Data Swapping: A Risk-Utility Framework and Web Service Implementation
نویسندگان
چکیده
Data swapping makes it impossible for an intruder to be certain of having identified an individual or entity in the database, because no record is certain to be unaltered. At the same time, the data are distorted (joint distributions involving both swapped and unswapped attributes change), decreasing their utility for purposes such as statistical inference. Implementation of data swapping entails selection of swap attributes, the swap rate (fraction of records for which swapping occurs) and, possibly, constraints on unswapped attributes [3]. In our risk-utility framework, each candidate release is characterized by numerical values of disclosure risk and utility. See §5 for examples. A statistical agency would like to select a release that has both minimum risk and maximum utility, but ordinarily this is not possible: as shown in Figure 1, higher utility entails higher risk. Nevertheless, not all releases are sensible: any release dominates all other releases that have both lower utility and higher risk (which lie to its northwest in Figure 1), so that the choice should be made from the frontier (in economics, the efficient frontier) of undominated releases. Selection of a release on the frontier can be done by assessing the risk-utility balance subjectively or quantitatively, by means of an objective function that relates risk and utility. Figure 1 illustrates for a linear risk-utility tradeoff.
منابع مشابه
High Fuzzy Utility Based Frequent Patterns Mining Approach for Mobile Web Services Sequences
Nowadays high fuzzy utility based pattern mining is an emerging topic in data mining. It refers to discover all patterns having a high utility meeting a user-specified minimum high utility threshold. It comprises extracting patterns which are highly accessed in mobile web service sequences. Different from the traditional fuzzy approach, high fuzzy utility mining considers not only counts of mob...
متن کاملA Risk-Utility Framework for Categorical Data Swapping
Data swapping is a statistical disclosure limitation method used to protect the confidentiality of data by interchanging variable values between records. We propose a risk-utility framework for selecting an optimal swapped data release when considering several swap variables and multiple swap rates. Risk and utility values associated with each such swapped data file are traded off along a front...
متن کاملA customer oriented systematic framework to extract business strategy in Indian electricity services
Competition in the electric service industry is highlighting the importance of a number of issues affecting the nature and quality of customer service. The quality of service(s) provided to electricity customers may be enhanced by competition, if doing so offers service suppliers a competitive advantage. On the other hand, service quality offered to some consumers could decline if utilities foc...
متن کاملAdaptive Information Analysis in Higher Education Institutes
Information integration plays an important role in academic environments since it provides a comprehensive view of education data and enables mangers to analyze and evaluate the effectiveness of education processes. However, the problem in the traditional information integration is the lack of personalization due to weak information resource or unavailability of analysis functionality. In this ...
متن کاملAdaptive Information Analysis in Higher Education Institutes
Information integration plays an important role in academic environments since it provides a comprehensive view of education data and enables mangers to analyze and evaluate the effectiveness of education processes. However, the problem in the traditional information integration is the lack of personalization due to weak information resource or unavailability of analysis functionality. In this ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003